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Abstract 
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^ ' Following a strategy recently developed by Ivan Nourdin and Giovanni Peccati, we provide a general 

technique to compare the tail of a given random variable to that of a reference distribution. This enables 
us to give concrete conditions to ensure upper and/or lower bounds on the random variable's tail of 
various power or exponential types. The Nourdin-Peccati strategy analyzes the relation between Stein's 
method and the Malliavin calculus, and is adapted to dealing with comparisons to the Gaussian law. By 
studying the behavior of the solution to general Stein equations in detail, we show that the strategy can 
be extended to comparisons to a wide class of laws, including many Pearson distributions. 
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1 Introduction 

In this article, following a strategy recently developed by Ivan Nourdin and Giovanni Peccati, we provide a 
^\ ' general technique to compare the tail of a given random variable to that of a reference distribution, and apply 

j3 ■ it for all reference distributions in the so-called Pearson class, which enables us to give concrete conditions to 

ensure upper and/or lower bounds on the random variable's tail of power or exponential type. The strategy 
uses the relation between Stein's method and the Malliavin calculus. In this introduction, we detail the 
main ideas of this strategy, including references to related works; we also summarize the results proved in 
this article, and the methods used to prove them. 

1.1 Stein's method and the analysis of Nourdin and Peccati 

Stein's method is a set of procedures that is often used to measure distances between distributions of random 
variables. The starting point is the so-called Stein equation. To motivate it, recall the following result which is 
sometimes referred to as Stein's lemma. Suppose X is a centered random variable. Then X ^ Z ^ A/'(0, 1) 
if and only if 

E[/'(X) - Xf{X)] = (1) 
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for all continuous and piecewise differentiable functions / such that E[|/'(X)|] < cx) (see e.g. [1], [S], |21)). 
If the above expectation is non-zero but close to zero, Stein's method can give us a way to express how close 
the law of X might be to the standard normal law, in particular by using the concept of Stein equation. 
For a given test function /i, this is the ordinary differential equation /' [x) — xf (x) = /i (x) — E [/i (Z)] 
with continuous and piecewise differentiable solution /. As we will see in more detail and greater generality 
further below, if one is able to prove boundedness properties of / and /' for a wide class of test functions /, 
this can help evaluate the distance between the law of Z and laws of random variables that might be close to 
Z, including methods for proving convergence in distribution. This fundamental feature of Stein's method 
is described in many works; see |4] for a general introduction and review. 

As a testament to the extraordinary versatility of Stein's method, recently Ivan Nourdin and Giovanni 
Peccati discovered a connection between Stein's method and the Malliavin calculus, with striking applications 
in a number of problems in stochastic analysis. Motivated by Berry-Esseen-type theorems for convergence 
of sequences of random variables in Wiener chaos, Nourdin and Peccati's first paper [H] on this connection 
considers an arbitrary square-integrable Malliavin-differentiable random variable X on a Wiener space, and 
associates the random variable 

G -.^ {DX;-DL-^X) (2) 

where D is the Malliavin derivative operator on the Wiener space, and L^^ is the pseudo- inverse of the 
generator of the Ornstein-Uhlenbeck semigroup (see Section 13.11 for precise definitions of these operators) . 
One easily notes that if X is standard normal, then G = 1. Then by measuring the distance between G 
and 1 for an arbitrary X, one can measure how close the law of X is to the normal law. The connection to 
Stein's method comes from their systematic use of the basic observation that E [Gf {X)] — E [Xf (X)]. It 
leads to the following simple and efficient strategy for measuring distances between the laws of X and Z. 
To evaluate, e.g., E [/i(X)] — E [h{Z)] for test functions h, one can: 

1. write E [h{X)] - E [h{Z)] using the solution of Stein's equation, as E [/' (X)] - E [Xf (X)]; 

2. use their observation to transform this expression into E [/' {X) (1 — G)]; 

3. use the boundedness and decay properties of /' (these are classically known from Stein's equation) to 
exploit the proximity of G to 1. 

As we said, this strategy of relating Stein's method and the Malliavin calculus is particularly useful for 
analyzing problems in stochastic analysis. In addition to their study of convergence in Wiener chaos in [S], 
which they followed up with sharper results in [9], Nourdin and Peccati have implemented several other 
applications including: the study of cummulants on Wiener chaos [11], of fluctuations of Hermitian random 
matrices [12j . and, with other authors, other results about the structure of inequalities and convergences on 
Wiener space, such as [3], [13], [14], [15]. In [16], it was pointed out that if p denotes the density of X, then 
the function 

/•OO 

g(z) := p-\z) / ypiy)dy, (3) 



which was originally defined by Stein in "^ , can be represented as 

g(z) = E[G|X = z], 

resulting in a convenient formula for the density p, which was then exploited to provide new Gaussian lower 
bound results for certain stochastic models, in [16] for Gaussian fields, and subsequently in [22] for polymer 
models in Gaussian and non-Gaussian environments, in [18] for stochastic heat equations, in f3] for statistical 
inference for long-memory stochastic processes, and multivariate extensions of density formulas in [T]. 



1.2 Summary of our results 

Our specific motivation is drawn from tire results in j22j which make assumptions on how G conrpares to 
1 almost surely, and draw conclusions on how the tail of X, i.e. P [X > z], compares to the normal tail 
F [Z > z]. By the above observations, these types of almost-sure assumptions are equivalent to comparing 
the deterministic function g to the value 1. For instance, one result in |22| can be summarized by saying that 
(under some additional regularity conditions) if G > 1 almost surely, i.e. if g (z) > 1 everywhere, then for 
some constant c and large enough z , P [X > z] > cP [Z > z]. This result, and all the ones mentioned above, 
concentrate on comparing laws to the standard normal law, which is done by comparing G to the constant 
1, as this constant is the "G" for the standard normal Z. 

In this paper, we find a framework which enables us to compare the law of X to a wide range of laws. 
Instead of assuming that g is comparable to 1, we only assume that it is comparable to a polynomial of degree 
less than or equal to 2. In [21], Stein had originally noticed that the set of all distributions such that their 
g is such a polynomial, is precisely the so-called Pearson class of distributions. They encompass Gaussian, 
Gamma, and Beta distributions, as well as the inverse-Gamma, and a number of continuous distributions 
with only finitely many moments, with prescribed power tail behavior. This means that one can hope to give 
precise criteria based on g, or via Malliavin calculus based on G, to guarantee upper and/or lower bounds 
on the tail P [X > z], with various Gaussian, exponential, or power- type behaviors. We achieve such results 
in this paper. 

Specifically, our first set of results is in the following general framework. Let Z he a reference random 
variable supported on (a, 6) where — oo < a < b < +oo, with a density p* which is continuous on R and 
differentiable on (a, &). The function g corresponding to p* is given as in ^, and we denote it by 5, (the 
subscripts * indicate that these are relative to our reference r.v.): 

9* (2) = T-s l(a,b)(^)- (4) 

P*[Z) 

We also use the notation 

$, (z) ^P[Z> z] 

for our reference tail. Throughout this article, for notational convenience, we assume that Z is centered 
(except when specifically stated otherwise in Section [5^ in the Appendix). Let X be Malliavin-differentiable, 
supported on (a, b), with its G := {DX; —DL^^X) as in ^. 



• 



(Theorem [T^ Under mild regularity and integrability conditions on Z and X , li G > g^ {X) almost 
surely, then for all z < b, 



P[X >z]><^,{z)--^ f {2y - z)P[X > y] dy, 



• 



where 

Q (z) :== z^ - Z5* (z) + .g* (z) ; (5) 

typically Q is of order z^ for large z. 

(Theorem [T5|) Under mild regularity and integrability conditions on Z and X, if G < g* {X) almost 
surely, then for some constant c and all large enough z < b, 

P[X> z] <c<P,{z). 



These results are generalizations of the work in |22| . where only the standard normal Z was considered. 
They can be rephrased by referring to g as in ([31), which coincides with 5 (z) = E \G\X = z], rather than 
G; this can be useful to apply the theorems in contexts where the definition of X as a member of a Wiener 
space is less explicit than the information one might have directly about g. We have found, however, that 
the Malliavin-calculus interpretation makes for efficient proofs of the above theorems. 

The main application of these general theorems are to the Pearson class: Z such that its g^ is of the 
form g* (z) — az^ + /3z + 7 in the support of Z . Assume b — +00, i.e. the support of Z is (a, +cxd). Assume 
E [l^p] < 00 (which is equivalent to a < 1/2). Then the lower bound above can be made completely 
explicit, as can the constant c in the upper bound. 

• fCorollarvl2ip Under mild regularity and integrability conditions on X [including assuming that there 
exists c > 2 such that g {z) < z^jc for large z], if G > g* [X) almost surely, then for any c' < 
1/ (1 + 2(1 — a)(c — 2)) and all z large enough, 

P[X>z] >c!^^{z). 

• (Corollary [^n]) Under mild regularity and integrability conditions on X, if G < 5* (X) almost surely, 
then for any c > (1 — a)/(l — 2q;), and all z large enough, 

V\X> z\ <C$,(2). 

The results above can be used conjointly with asymptotically sharp conclusions when upper and lower 
bound assumptions on G are true simultaneously. For instance, we have the following, phrased using g's 
instead of G's. 



• 



(Corollary[521 point 2) On the support (a, +00), let 5, {z) = q:z^ + /3z + 7 and let g^ (z) = az^ + /3z + 7 
with non-zero a and a. If for the Malliavin-differentiable X and its corresponding 5, we have for all 
z > a, gt, (z) < 5 (z) < 5* (z), then there are constants c and c such that for large z, 



cz 



-1-1/a < pj^ ^ ^j < cz-1-1/". 



• (see Corollary [23]) A similar result holds when a = a = 0, in which P[X > z] compares to the 
Gamma-type tail z~'^~^^^ exp(— z//3). 

The strategy used to prove these results is an analytic one, following the initial method of Nourdin and 
Peccati, this time using the Stein equation relative to the function g^ defined in ([4]) for a general reference 
r.v. Z: 

g, (x) /' {x) - xf {x) = h{x)--E[h {Z)] . 

Our mathematical techniques are based on a careful analysis of the properties of g^, its relation to the 
function Q defined in ([5]), and what consequences can be derived for the solutions of Stein's equation. 
The basic general theorems' proofs use a structure similar to that employed in |22j. The applications to 
the Pearson class rely heavily on explicit computations tailored to this case, which are facilitated via the 
identification of Q as a useful way to express these computations. 

This article is structured as follows. Section 2 gives an overview of Stein's equations, and derives some 
fine properties of their solutions by referring to the function Q. These will be crucial in the proofs of our 
general upper and lower bound results, which are presented in Section 3 after an overview of the tools of 
Malliavin calculus which are needed in this article. Applications to comparisons with Pearson distributions, 
with a particular emphasis on tail behavior, including asymptotic results, are in Section 4. Section 5 is an 
Appendix containing the proofs of some technical lemmas and some details on Pearson distributions. 

This article is dedicated to the memory of Professor Paul Malliavin. 



2 The Stein equation 

2.1 Background and classical results 

Characterization of the law of Z. As before, let Z be centered with a differentiable density on its 
support (a, &), and let g* be defined as in ^. Nourdin and Peccati (Proposition 6.4 in [5]) collected the 
following results concerning this equation. If / is a function that is continuous and piecewise continuously 
differentiable, and if E[|/'(Z)|g*(Z)] < oo. Stein (Lemma 1, p. 59 in f5T]) proved that 

E[5,(Z)/'(Z)-Z/(Z)]-0 

(compare this with ([T]) for the special case Z ~ A/'(0, 1)). Conversely, assume that 



dz — CO and / — —^ dz — — oo. (6) 

9*[z) J a 9*{z) 

If a random variable X has a density, and for any differentiable function / such that x M- \g*{x) f {x)\ + \x f {x)\ 
is bounded, 

n9*iX)f'iX)-Xf{X)]^0 (7) 

then X and Z have the same law. In other words, under certain conditions, ^ can be used to characterize 
the law of a centered random variable X as being equal to that of Z. 

Stein's equation, general case; distances between distributions. If /i is a fixed bounded piece- 
wise continuous function such that E[|/i(Z)|] < oo, the corresponding Stein equation for Z is the ordinary 
differential equation in / defined by 

h{x) - nnZ)] - 9*ix)f'{x) - xf{x). (8) 

The utility of such an equation is apparent when we evaluate the functions at X and take expectations: 

E[h{X)] - E[HZ)] = n9*{X)f{X) - Xf{X)]. (9) 

The idea is that if the law of X is "close" to the law of Z, then the right side of (|9]) would be close to 0. 
Conversely, if the test function h can be chosen from specific classes of functions so that the left side of ([9]) 
denotes a particular notion of distance between X and Z, the closeness of the right-hand side of ([9]) to zero, 
in some uniform sense in the /'s satisfying Stein's equation (|S]) for all the ft,'s in that specific class of test 
functions, will imply that the laws of X and Z are close in the corresponding distance. For this purpose, it 
is typically crucial to establish boundedness properties of / and /' which are uniform over the class of test 
functions being considered. 

For example, liH —{h : ||^||l-I-||/i||oo < 1} where \\-\\l is the Lipschitz seminorm, then the Fortet-Mourier 
distance dpM {X, Z) between X and Z is defined as 

dpAiiX, Z) = sup |E[/i(X)] - nh(Z)]\- 
hen 

This distance metrizes convergence in distribution, so by using properties of the solution / of the Stein 
equation ([9]) for h (z H, we can draw conclusions on the convergence in distribution of a sequence {^n} to 
Z. See 8 and [6 for details and other notions of distance between random variables. 

Solution of Stein's equation. Stein (Lemma 4, p. 62 in 21 ) proved that if ([6|) is satisfied, then his 
equation fS]) has a unique solution / which is bounded and continuous on (a, b). li x ^ (a, 6), then 



while if a; G (a, b), 

f{x)^ {h{y)-^[h{Z)])'^^^dy. (11) 

J a 9*\y) 

2.2 Properties of solutions of Stein's equations 

We assume throughout that p, is differentiable on (a, 6) and continuous on R (for which it is necessary that 
p* be null on R— (a, b) ). Consequently, 5, is differentiable and continuous on (a, b). The next lemma records 
some elementary properties of g* , such as its positivity and its behavior near a and b. Those facts which 
are not evident are established in the Appendix. All are useful in facilitating the proofs of other lemmas 
presented in this section, which are key to our article. 

Lemma 1 Let Z be centered and continuous, with a density p* that is continuous on R and differentiable 
on its support {a,b), with a and b possibly infinite. 

1. 5, (x) > if and only if x £ (a, b); 

2. 5, is differentiable on {a,b) and we have [g^:{x)p^:{x)y ~ —xp^{x) therein; 

3. lim g^{x)p^{x) — lim g,(a;)/9,(a;) = 0. 

A different expression for the solution / of Stein's equation ^ than the one given in (fTOlfTTj) . which will 
be more convenient for our purposes, such as computing /' in the support of Z , was given by Schoutens |20j 
as stated in the next lemma. 

Lemma 2 For all x G (a, b), 



/(^) - g^(^)^^(^) / {h{y)^'^[h{Z)])p,{y)dy. (12) 



If X ^ [a^b], differentiating \10j) gives 



_^/(^) ^ -xh'{x)+h{x)-^[h{Z)] 
x"^ 

while if X € {a,b), differentiating hl2fl gives 

f'ia:) = --^^— rihiy)-nh{Z)])pMdy+^^^^^^^^. (14) 

[9*ix)rP*ix) J a 9*[X) 

The proof of this lemma (provided in the Appendix for completeness) also gives us the next one. 

Lemma 3 Under our assumption of differentiability on (a, 6) 0/ p* and hence of g^, Stein's condition (0) 
on 5* is satisfied. 

In Stein's equation (jS]), the test function h = l(_oo.z] lends itself to useful tail probability results since 
E[/i(Z)] = P[Z < z]. From this point on, we will assume that h = l(-oo,z] with fixed z > 0, and that / is 
the corresponding solution of Stein's equation (we could denote the parametric dependence of / on z by /^ , 
but choose to omit the subscript to avoid overburdening the notation) . 

As opposed to the previous lemmas, the next two results, while still elementary in nature, appear to be 
new, and their proofs, which require some novel ideas of possibly independent interest, have been kept in the 
main body of this paper, rather than having them relegated to the Appendix. We begin with an analysis of 
the sign of /', which will be crucial to prove our main general theorems. 



Lemma 4 Suppose < z < b. If x < z, then f'{x) > 0. If x > z, then f'{x) < 0. 



Proof. The result follows easily from dTS]) when x ^ [a, b]: ii x < a, then f'{x) = (1 - E[h{Z)]) jx^ > 0, 
while if X > 6, then f'{x) = —E[h{Z)]/x'^ < 0. So now we can assume that x G (a,b). We will use the 
expression for the derivative /' given in p^ . 

Suppose a < x < z. Then h(x) — 1 and for any y < x, h{y) = 1 so 



/'(•^) = 



Lg*(a;)]2p4a;) J^ 

i-nnz)] 



{l-E[h(Z)])p,{y)dy- 



i-E[M^)] 



Clearly, f'{x) > if x > 0. Now define 

ni{x) := 



X / P*iy)dy + g^{x)p^{x) 



P*{y)dy 



g^{x)p*{x) 



We will show that xni{x) > when a; < 0. Since 

x[g^{x)p^{x)]' - 5*(a;)/9*(x) 



n'l (x) = p* (x) + 
= p,(x) + 



-a;^p*(x) - 5»(a:)p*(x) 9*{x)p^{x) 



<0 



g.(a:)p(x) 



then rii is non-increasing on (a, 0) which means that whenever a < a: < 0, 'ni{x) < lim ni(x) = lim 

since lim g^,{x)p^,{x) = 0. Therefore, a;ni(x) > for x < 0. This completes the proof that /'(x) > when- 

ever x < z. 

Finally, suppose that z < x < 6 so h{x) = 0. Since E[/i(Z)] = P[Z < z] — J^ p*{y) dy , 

xE[h{Z)] /"^ ^ ^^.^ ^^ E[/i(Z)] 



/'(^) 



X 



[5*(x)]2p,(x 

X 



[5*(x)]2p4x 
E[/i(Z)] 



[5H.(x)]2p,(x 



[5H.(x)]2p4x 



Hy)p*{y)dy 

P*{y)dy 
nh{Z)] - 



[9*i.x)f P*{x) J a 

xE[h(Z)] 



[9*{x)?P*{x) J a 

xE[/i(Z)] 



P*{y)dy 
P*{y)dy- 



9*{x) 

nKz)] 



P*iy)dy^ 



[g*(x)]2p4x) J^ 
X - X I pt{y)dy - g* (x)/?* (x) 

xn2(x) 



9*{x) 
g*{x) 



where 



It is enough to show that n2(x) < since x > z > 0. Since n.2(x) = —n'i{x) > 0, then 7^2(2;) < lim Ti2(x) = 



f \ 1 / / \^ .g,(x)pH.(x) 
n2(x) := 1 - / /9,(y) dy = 1 - ?ii(x). 



g.(a;)p,(a:) _ 



x—>b 



1 — lim / p*{y) dy — lim S'^^Jp*'.-^-' = g because lim g^,{x)pf,{x) = 0. Therefore, /'(x) < if x > z, finishing 

x^b ^ x—^b ^ x-^b 

the proof of the lemma. ■ 



As alluded to in the previous subsection, of crucial importance in the use of Stein's method, is a quanti- 
tatively explicit boundedness result on the derivative of the solution to Stein's equation. We take this up in 
the next lemma. 



Lemma 5 Recall the function 

Q[x) :— x^ — xg'.^{x) + g^{x) 

defined in Q), for all x E Tl except possibly at a and b. Assume that g'l{x) < 2 for all x and that ^ qi'X 
tends to a finite limit as x ^f a and as x -^ b. Suppose < z < b. Then f'{x) is bounded. In particular, if 
a < X < z, 

< fix) < ' + ^ < oo, (15) 

[g.4z)Yp4z) Q(0) 

while if b > X > z, 

-oo<-^</'(x)<0. (16) 

To prove this lemma, we need two auxiliary results. The first one introduces and studies the function 
Q which we already encountered in the introduction, and which will help us state and prove our results in 
an efficient way. The second one shows the relation between Q, g*, and the tail $* of Z, under conditions 
which will be easily verified later on in the Pearson case. 

Lemma 6 

1. If X ^ {a,b), then Q{x) ^ x^ > 0. 

2. If g* is twice differ entiable in {a,b) (for example, when p* is twice differentiable) , then Q'{x) — 
x{2-g':{x)). 

3. If moreover g'l{x) < 2 everywhere in (a, b), a reasonable assumption as we shall see later when Z is a 
Pearson random variable, then Ta[Ti^a,b)Q = Q (0) so that Q{x) > Q(0) — g*{0) > 0. 

Lemma 7 With the assumptions on g^ and Q as in Lemma\^ then for all x, 

max(x-g^a;),0) 
Q{x) 

and 



g^{x)p^{x) < $H.(a:) (17) 



max g^x -x,0) , x / ^ . -, ^ / ^ ,.o\ 

-— g4x)p4x)<l-(S>4x). 18) 

Q(x) 

Moreover for < x < b, we have 

<^*{x) < - ■ g^{x)p^{x) (19) 

x 

while if a < X < 0, then 

1 - $, (.t) < g* (x)p* (x) . (20) 

— X 

Proof of Lemma H If a; < a with a > -oo, then f'{x) == (1 - F,[h{Z)]) /x'^ < (1 - E[/i(Z)]) /a^. 
If a; > fe with 6 < oo, then f'{x) = —'E[h{Z)]/x^ > —Ei[h{Z)]/b^. So now we only need to assume that 
X G (a, &). 

Suppose a < X < z. Use f'{x) > given in p^ : 



i-nhjz)] ( r \ 

\g^[x)Yp^(x) V A / 

X 1 

- \gM?p.{x) ^^ ' '^^^'^^^ + 1^) 



When X > 0, we can rewrite the upper bound as: 



where 



fix) < r{x) + 



r{x) 



9*{x) 



x<^^{x) 
g4x)p^{x) 



[9*ixWP*{x) g*ix)[g^{x)p^{x)] ' 



We can bound r(x) above since 



r'(x) = 



[g*{x)]'^p^{x) - X [g^x] {-xp^jx)) + g*{x)p*{x)gi{x)] 

[g.ix)]^p.ixf 
g^{x) + x^ - xg'^ix) _ Q{x) 



>0 



[9*{x)Yp*{x) [g*{x)fp*{x) 

so r{x) <r{z). To bound [1 — x<^^{x) / {g^{x)p^{x))] / g^{x), use (IT71) of Lemnia[71 



1 


\ x^*{x) 
g*ix)p^{x)_ 


g*{x) 
1 

9*{x) 
1 


[l 


1 


g*{x) 


g*{x)p^{x) 
■ x'^~xg'^{x)' 

[ Q{x) \ 

9*{x) _ 1 ^ 



g.{x) Q{x) Q{x) - Q(0) 



Therefore, 



fix) < 



[g.{z)Yp.[z) Q(0)- 



When a; < 0, we use P^ of Lemma [7J 
f\x) < 



(l-$,(a;)) + 



1 



< 



[9*{xWp*{x) '^ -"y-" ■ g^(^^^ 
X gi (x) ~ X 



[5*(x)]2p,(x) Q{x) 
1 xgl{x) - x^ 1 



g4x)p^{x) + 



9*{x) 



g^{x) Q{x) g*{x) 

xg^x) -x^ + Q{x) 



g*{x) 



< 



Q{x) 
1 



1 1 

< 



Q{x) - Q(0) 



[g.{z)Yp.(z) Q(0)- 
Now we prove (|16p and so suppose a; > z > 0. From the proof of Lemma SI 

nKz)] 



fix) 



[g*{xWp*{x) 

nh{z)] 

[g*{x)Yp^{x) 



X - X p^{y)dy - g^ (x)p* (a;) 
{x^^{x) - g^{x)p^{x)) . 



We conclude by again using ([T7|) of Lemma [3 to get 

-f'{x) < — -- {g^{x)p^{x) - x<i>^{x)) 

[9*[xjrP*[x) 

- r f M2 — TT 9* (^)^* {x)-x- * ■ g., {x)p^ {x) 

[g4x)Yp4x) V Q{x) 

1 f_^ a;2 -xg^a;)^ _ 1 g^{x) 



g*{x) \ Q{x) J g^{x) Q{x) 



1 1 

< 



Q{x) - Q{z) 



Remark 8 Since z > 0, and Q (z) > Q (0), Leninia\^ implies the following convenient single hound for any 
fixed 2 > 0, uniform, for all x e (a, b): 

\nx)\ < ' ^ 



[g,{zWp,{z) Q(0)- 

Occasionally, this will be sufficient for some of our purposes. The more precise bounds in Lemma\B[will also 
be needed, however. 

3 Main results 

In order to exploit the boundedness of /', we adopt the technique pioneered by Nourdin and Peccati, to 
rewrite expressions of the form E[Xto(X)] where m is a function, using the Malliavin calculus. For ease of 
reference, we include here the requisite Malliavin calculus constructs. Full details can be found in [17 ; also 
see |22l Section 2] for an exhaustive summary. 

3.1 Elements of Malliavin calculus 

We assume our random variable X is measurable with respect to an isonormal Gaussian process W, associated 
with its canonical separable Hilbert space H. For illustrative purposes, one may further assume, as we now 
do, that W is the standard white- noise corresponding to iJ = L^([0, 1]), which is constructed using a 
standard Brownian motion on [0, 1], also denoted by W, endowed with its usual probability space (O, J", P). 
This means that the white noise W is defined by W (/) = /q / (s) dW (s) for any f £ H, where the stochastic 
integral is the Wiener integral of / with respect to the Wiener process W. If we denote Iq (/) = / for any 
non-random constant /, then for any integer n > 1 and any symmetric function / G H" , we let 

/„(/) :=n! / [■■■[" ' f{si,S2,--- ,s„)dWis,,)---dWis2)dW{s,), 

Jo JQ JQ 

where this integral is an iteration of n ltd integrals. It is called the nth multiple Wiener integral of / w.r.t. 
W, and the set n„ := {/„ if):f€ iJ"} is the nth Wiener chaos of W. Note that h (/) = W (/), and that 
E [/„ (/)] = for all n > 1. Again, see [171 Section 1.2] for the general definition of /„ and Hn when VF is a 
more general isonormal Gaussian process. The main representation theorem of the analysis on Wiener space 
is that L^ (ri, J^, P) is the direct sum of all the Wiener chaoses. In other words, X G L^ (fi, JF, P) if and only 
if there exists a sequence of non-random symmetric functions /„ G H" with X]^o ll/nll/f" < °° ^'^'^^ ^^^^ 
-^ ~ X]^o -^" (/")• Note that E [X] = /q. Moreover, the terms in this so-called Wiener chaos decomposition 
of X are orthogonal in L^ i^,J^, P), and we have the isometry property E [X^] = X]^o "' ll/nll_ffi- We are 
now in a position to define the Malliavin derivative D. 
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Definition 9 Let D^'^ be the subset of L'^ (f^, J^,P) formed by those X — X!^o-^^ (-^^0 such that 

oo 

The Malliavin derivative operator D is defined from D^'^ to L^ {fl x [0, 1]) by DX = if X ^ EX is 
non-random, and otherwise, for all r G [0, 1], by 

oo 

i?,X = ^n/„_i(/„(r,.)). 

n=l 

This can be understood as a Frechet derivative of X with respect to the Wiener process W. li X — W (/) 
then DX = f. Of note is the chain-rule formula D (F (X)) — F' {X) DX for any differentiable F with 
bounded derivative, and any X S D^'^. 

Definition 10 The generator of the Ornstein-Uhlenbeck semigroup L is defined as follows. Let X = 
'Yli^=i^n{fn) be a centered r.v. in L'^{Q). -(/ X^^i "'^"'' l/n| < ^O; then we define a new random vari- 
able LX in L^ (n) by —LX = X]?^i "-^n (/«)■ "^^^ pseudo-inverse of L operating on centered r.v. 's in L^ (Q,) 
is defined by the formula —L^^X = X^^i ~^ri (/«) • V ^ is not centered, we define its image by L and L^^ 
by applying them to X — 'EX . 

As explained in the introduction, for X £ D^'^, the random variable G :— (^DX;—DL^^X^ plays a 
crucial role to understand how X^s law compares to that of our reference random variable Z. The next 
lemma is the key to combining the solutions of Stein's equations with the Malliavin calculus. Its use to prove 
our main theorems relies heavily on the fact that these solutions have bounded derivatives. 

Lemma 11 (Theorem 3.1 in f^', Lemma 3.5 in 'SEj) Let X G D^'^ be a centered random variable with a 
density, and G = {DX; —DL^^X)h- For any deterministic, continuous and piecewise differentiable function 
m such that m' is bounded, 

'E,[Xm{X)]=E[m'{X)G]. 

3.2 General tail results 

The main theoretical results of this paper compare the tails of any two random variables X and Z , as we 
now state in the next two theorems. In terms of their usage, Z represents a reference random variable in 
these theorems; this can be seen from the fact that we have a better control in the theorems' assumption on 
the g* coming from Z than on the law of X. Also, will apply these theorems to a Pearson random variable 
Z in the next section, while there will be no restriction on X G D^'^ beyond the assumption of the theorems 
in the present section, we will see that all assumptions on Z in this section are satisfied when Z is a Pearson 
random variable. 

Theorem 12 Let Z be a centered random variable with a twice differentiable density over its support {a,b). 
Let g^, and Q be defined as in H^ and ^, respectively. Suppose that g'J{x) < 2, and ^ q() ho,s a finite 
limit as X -^ a and x -^ b. Let X G D^'^ be a centered random variable with a density, and whose support 
{a,bx) contains (a, 6). Let G be as in 0^. If G > g* {X) a.s., then for every z G (0,6), 

1 [^ 

P[X > z]>^^4z)-—— {2x-z)I>[X >x]dx. (21) 
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Proof. Taking expectations in Stein's equation (|S]), i.e. referring to Q, we have 

P[X <z]- V\Z < z] = E[5,(X)/'(X) - X!{X)\ 

which is equivalent to 

V{X >z\~ $,(z) - E[X/(X) - g,{X)r{X)\. 

Since g^.(X) ^ almost surely and f(x) < if x > z, 

P[X > z] - $,(z) = E[lx<.X/(X)] + E[lx>.X/(X)] - E[lx<..9,(X)/'(X)] - E[lx>..9*(X)/'(X)] 
> E[lx<.X/(X)] + E[lx>.X/(X)] - E[lx<..9*(X)/'(X)] 

Let rn(x) = [/(a) — /(z)]lx<a + [/(a^) — /(2)]la<i<z where the first term is if a = — oo. Note that m is 
continuous and piecewise differentiable. The derivative is m'(x) = f'{x)la<x<z except at x = a and x = z. 
We saw in Lemma[5]that /' is bounded. Therefore, since X S D^'^, we can use Lemma [TT] to conclude that 

[/(a) - /(z)]E[lx<a^] + n'i-a<x<zX{f{X) - f{z))] = n'i-a<x<zf'{X)G]. 

from which we derive 

E[ix<zXf{X)] - /(z)E[lx<.X] = mx<zf'{X)G]. 

Therefore, 

P[X > z] - $,(z) > {-E[lx<zf'{X)G] + f{z)E[lx<zX]} + I][lx>zXf{X)] - I][lx<z9*{X)f'iX)] 
= n^x<zf'{X){G - g,{X))] + f{z)-E[lx<zX] + E[lx>.X/(X)] 

> fiz)n'i-x<zX]+n^x>zXf{x)] 

= fiz)-E[lx<zX] + Il[lx>zX.f{X)] - f{z)-E[lx>zX] + f{z)-E[lx>zX] 
= f{z)E[X] + E[lx>zX{f{X) - /(z))] 
= E[lx>zXifiX)-fiz))] 

Write f{X) - f{z) = f'{0{X - z) for some random ^ > z {X > ^ also). Note that /'(^) < since ^ > z. 
We have P[X > z] - $,(z) > E[lx>zf'iOXiX - z)]. From LemmaEl 

Q[z) 

since from Lemma |51 Q is non-decreasing on (0,6). 

If we define S{z) := P[X > z], it is elementary to show (see [22]) that 

EllxyzXiX- z)]< I {2x- z)S{x)dx. 

J Z 

From V[X > z] - ^.(z) > E[lx>,/'(e)X(X - z)], 

V[X > z] > $,(z) - -— ^ / {2x - z)S{x) dx 
Q{z) A 

which is the statement of the theorem. 

Lastly the reader will check that the assumption that the supports of Z and X have the same left-endpoint 
is not a restriction: stated briefiy, this assumption is implied by the assumption G > 5* {X) a.s., because 
G = gx {X) and g* (resp. gx) has the same support as Z (resp. X). ■ 

To obtain a similar upper bound result, we will consider only asymptotic statements for z near b, and will 
need an assumption about the relative growth rate of g* and Q near b. We will see in the next section that 
this assumption is satisfied for all members of the Pearson class with four moments, although that section 
also contains a modification of the proof below which is more efficient when applied to the Pearson class. 
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Theorem 13 Assume all the conditions of Theorem {T^ hold, except for the support of X , which we now 
assume is contained in (a, b). Assume moreover that there exists c < 1 such that linisup2_j.t, g* (z) /Q (z) < c. 
If G < g<t {X) a.s., then there exists zq such that b > z > zq implies 

P[X>z] <J—^,{z). 
1 — c 

Proof. From Stein's equation ([8]), and its application ([9]), 

P[X >z]- <i>,(z) = nXfiX) - g-4X)f'{X)]. 

Since X S D'^'^, in Lemma 111 [ we can let to = / since / is continuous, differentiable everywhere except 
at a; = a and x = b, and from Lemma [S] has a bounded derivative. Therefore, 

P[X > z] - $,(z) = E[Gf'{X)] - E[g4X)f{X)] 

= E[/'(X)(G-5*W)] 

= nix<J'iX) (G - g^X))] + E[lx>,fiX) (G - g^X))] 

<nix>J'{X){G-g4X))] 

= E[lx>./'(X)E [G\X]] - E\\x>.f'{X)g^{X)\ 

where the last inequality follows from the assumption G — g*{X) < a.s. and if X < z, then f'{X) > 0. By 
Proposition 3.9 in [5], E [G|X] > a.s. Since f'{X) < if X > z, then by the last statement in Lemma[Sl 
and the assumption on the asymptotic behavior of g*/Q, for z large enough, 

P[X > z] - $,(z) < -E[lx>J'{X)g4X)] (22) 



< E 



i-X>z- 



'Q{x) 

< cP[X > z]. 
The theorem immediately follows. ■ 

4 Pearson Distributions 

By definition, the law of a random variable Z is a member of the Pearson family of distributions if Z's 

density p, is characterized by the differential equation p'^{z) / p.f{z) ~ (oiz + ao)/(az^ + /3z + 7) for z in its 

support {a,b), where —00 < a < b < 00. If furthermore E[Z] — 0, Stein (Theorem 1, p. 65 in |21) ) proved 

that (7* has a simple form; in fact, it is quadratic in its support. Specifically, g*{z) = az'^ + /3z + 7 for all 

z € (a, 6) if and only if 

P'M ^ {2a+l)z + /3 

p^{z) az^ + l3z + j' 

The Appendix contains a description of various cases of Pearson distributions, which are characterized 
by their first four moments, if they exist. In this section, we will operate under the following. 

Assumption PI Our Pearson random variable satisfies E [Z^] < 00 and z^p^{z) — > as z — > a and z — ^ 6. 

Remark 14 This assumption holds as soon as E [Z^] < cx), which, by Lemma W^ in the Appendix, holds if 
and only if a < 1/2. The existence of a second moment, by the same lemma, holds if and only if a < 1. 
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4.1 General comparisons to Pearson tails 

In preparation to stating corollaries to our main theorems, applicable to all Pearson Z's simultaneously, 
we begin by investigating the specific properties of g* and Q in the Pearson case. Because g*{z) = 
[az"^ + /3z + 7) l((j ;,')(z), we have the following observations; 

• Since g'^{z) — 2a on (a, 6), and a < 1 according to Remark UM it follows that g'^{z) < 2. 

• If z e (a, b), then 

Q{z) = z^ - zg^z) + g^z) = z'^ ~ z {2az + 13) + {az^ + /3z + 7) = (1 - a) z^ + 7 

and so Q{z) > Q{0) = 7 = 5* (0) > 0, where the last inequality is because g* is strictly positive on the 
interior of its support, which always contains 0. This is a quantitative confirmation of an observation 
made earlier about the positivity of Q in the general case. 

• As z — > a and z -^ b, 

z-g'M _ (1-2q)z^/? 

Q{z) (l-a)z2+7 

approaches a finite number in case a and b are finite. As \z\ -^ 00, the above ratio approaches 0. 

• We have E [Z^] = j-^- Again, this is consistent with 7 > and a < 1. 

Remark 15 The above observations collectively mean that all the assumptions of Theorem W^ are satisfied 
for our Pearson random variable Z , so we can state the following. 



Proposition 16 Let Z be a centered Pearson random variable satisfying Assumption PI. Let g^, be defined 
as in (01). Let X G D^'^ be a centered random variable with a density, and whose support {a,bx) contains 
(a, b). Suppose that G > 5* (X) a.s. Then for every z € (0, b), 



1 /■'' 

F[X > z] > $*(z) - ^^ / {2x - z)F[X > x] dx. (24) 

(l-a)z2+7 Jz 

We have a quantitatively precise statement on the relation between Var[Ar] and the Pearson parameters. 

Proposition 17 

L Assume that the conditions of Proposition [751 hold, particularly that G > g^:{X); assume the support 
{a,b) of g^ coincides with the support of X . Then 

Var[X] > ^— = Var[Z]. 
1 — a 

2. If we assume instead that G < g*{X) a.s., then the inequality above is reversed. 

Proof. Since X has a density, we can apply Lemma [TT] and let m{x) = x. 

YsiT[X] = E[Xm(A:)] = E[G] > E[g4X)] 

> E[g4X)] = E[la<x<b {aX^ + I3X + 7)] = aE[X^] + /3E[X] + 7 
(l-a)Var[A:] > /? • + 7 

VarfAT] > ^— = VarfZl. 
1 — a 

This proves point 1. Point 2 is done identically. ■ 

In order to formulate results that are specifically tailored to tail estimates, we now make the following. 
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Assumption P2 The right-hand cndpoint of our Pearson distribution's support is 6 = +00 

Remark 18 Assumption P2 leaves out Case 3 in the Appendix in our list of Pearson random variables, i.e. 
the case of Beta distributions. Therefore, inspecting the parameter values in the other 4 Pearson cases, we 
see that Assumption P2 implies a > 0, and also implies that if a = 0, then /3 > 0. 

Remark 19 In most of the results to follow, we will assume moreover that a < ^. By Lemma \28[ this 

is equivalent to requiring E \Z\ < 00, and more generally from the lemma, our Pearson distribution has 

moment of order m if and only if a < l/{m — 1). As mentioned, a < ^ thus implies Assumption PI. 
Consequently Theorem [73 implies the following. 

Corollary 20 Let Z be a centered Pearson random variable satisfying Assumption P2 (support of Z is 
(a, +00)/ Assume a < 1/2. Let g* be defined as in Op. Let X G D^'^ be a centered random variable with a 
density and support contained in (a, +00). If G < 5* {X) a.s., for any K > . ~" , there exists zq such that 
if z > zq, then 

P[X > z] <K ^^{z). 

Proof. Since 

5, {z) _ az^ + f3z + '-f 



Q{z) (l-a)z2+7 

then Umsup2_j.Q^ g* (z) /Q (z) = a/ (1 — a) < 1 if and only if a < ^. Therefore, Theorem 1131 applies in this 
case, and with the c defined in that theorem, we may take here any c > a/ (1 — a), so that we may take any 
K = 1/(1 — c) as announced. ■ 

The drawback of our general lower bound theorems so far is that their statements are somewhat implicit. 
Our next effort is to fix this problem in the specific case of a Pearson Z: we strengthen Proposition [TBI so 
that the tail P [X > z] only appears in the left-hand side of the lower bound inequality, making the bound 
explicit. The cost for this is an additional regularity and integrability assumption, whose scope we also 
discuss. 

Corollary 21 Assume that the conditions of Proposition \16\ hold; in particular, assume X G D^'^ and 
G > aX^ + j3X + 7 a.s. In addition, assume there exists a constant c > 2 such that P [X > z] < zp (z) /c 
holds for large z (where p is the density of X). Then for large z, 

P\X >z]> (^-^)Q(^) ^Jz) « (^-2)-a(c-2) 

^ ^- {c-2)Q{z) + 2z^*^^ c-a(c-2) *^>- 

The existence of such a c > 2 above is guaranteed if we assume g{z) < z^ jc for large z, where g (x) := 
E [G|X — x] (or equivalently, g defined in ^B^). Moreover, this holds automatically if G < g^^ {X) a.s. for 
some quadratic function g^ {x) = ax"^ + jSx + 7 with a < 1/2. 



Proof. Since z > 0, we can replace 2a; — z by 2x in the integral of (|24|) . 

/OO -1 /"OO 

xS{x)dx<- x^\S'{x)\dx 

/ /"GO 

z 5(z) — lim X S{x)~^2 \ xS{x)dx 



<-(z^S{z) + 2F{z)) 
c 

F{z) < -^z'S{z) 
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Therefore 



S{z) = P[X >z]> Mz) - j^^F{z) > M^) - y S{z) 

Q(z) [c-2)Q{z) 



S{z) 



2z 
1 



(c-2)Q(z) 



>^*{z) 



{c^2)Q{z) ^ {c-2){l-a) 

^^'>- {c-2)Q{z) + 2z^^*^'>^ {c-2){l-a) + 2^*^'>- 

This proves the inequality of the Corollary. 

To prove the second statement, recall that ^6| Theorem 3.1] showed that 

xp{x) dx 

P-a.s. It is also noted therein that the support of p is an interval since X e D^'^. Therefore, 

z 1 f°° X f°° 

—p{z)^—9{z)p{z)^ I —p{x)dx> / p(x)dx 
c z Jz z J^ 

a.s. This finishes the proof of the corollary. ■ 

4.2 Comparisons in specific scales 

In this section and the next, we will always assume X G D^'^ is a centered random variable with a density 
and with support (a, oo), and we will continue to denote by g the function defined by g {x) := E [G\X = x], 
or equivalently, defined in ([3]). 

We can exploit the specific asymptotic behavior of the tail of the various Pearson distributions, via 
Lemma [57] in the Appendix, to draw sharp conclusions about X's tail. For instance, if g is comparable to a 
Pearson distribution's 5, with a 7^ 0, we get a power decay for the tail (Corollary [22] below), while if a is 
zero and /3 is not, we get comparisons to exponential-type or gamma-type tails (Corollarv l23l below). In both 
cases, when upper and lower bounds on G occur with the same a on both sides, we get sharp asymptotics 
for X's tail, up to multiplicative constants. 

Corollary 22 Let g, (x) := ax^ + j3x + 7 and g^ (x) := ax'^ -\- j5x + j be two functions corresponding to 
Pearson distributions (e.g. via 0)j where < a < a < 1/2. 

1. If g (x) < cjf, [x) for all x > a, then there is a constant c^ [a, f3,^j > such that for large z, 

L J — ^l + l/a 

2. If g* (x) < g (x) < g* (x) for all x > a, then there are constants Cu (a,/3,7) > and ci (a,a,/3,7) > 
such that for large z, 

""' < P[X > z] < ''^ 



Proof. Let <&*a,^,7 and <&H.a J,7 be the probability tails of the Pearson distributions corresponding to 
5* and 5* respectively. We can prove Point 1 by using Corollary [50] and Lemma [57] There is a constant 
ku (a, ,5, 7) > such that, for any K > jr^j fo^' large z, 

P[X>z]<A^$,,,^,,(z)<X.-^. 
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The upper bound in Point 2 follows directly from Point 1 because of the condition g (x) < 5, (x) . This same 
condition also allows us to give a lower bound for P[X > z]. Fix any c G (2, l/a). By Corollary [?T] and 
Lemma [571 there is a constant ki (a,/3,7) > such that for large z, 

nx > .] > ^''-'^-r^\-'^'^..^,. (^) > (^ - 2) - " (^ ' 2) ^' 



c-a{c-2) -*^^P^^^-^- c-a{c-2) z^+Va ' 

■ 

Corollary 23 Let g^, [x) := (/3x + 7) , and g^, (x) = (/3x + 7) be two functions corresponding to Pearson 
distributions (e.g. via ^) where /3,/3,7,7 > and a — —^/p. 

1. If g (x) < 5, (x) for all x, then there is a constant Cu (/?, 7) > such that for large z, 

V\X > z\< c„z-i-^/^'e- 



-1-7/^%-z/^ 



^- ^1 9* (2;) < 5 (a;) < 5* (a;) for all x, then there are constants Cu (/3, 7) > Q {(3, 7) > such that for large 

ci z-i-^//5'e-^/^ < P[X >z]< c„z-i-^/^'e-^/^. 

Proof. Let $*/3,7 and $,« ;^ be as in the proof of the previous corollary, noting here that a = a — 0. 
The proof of Point 1 is similar to the proof of Point 1 in Corollary [22l The upper bound in Point 2 follows 
from Point 1 above and Point 1 of Corollary [22] For the lower bound of Point 2, if we fix any c > 2, then by 
Corollary [5T] and Lemma [571 there is a constant fc; (/?,7) > such that for large z, 

P[X >z]> ^^$,^,, >^^k z-i-^/^'e-^/^. 



Remark 24 The above corollary improves on a recently published estimate: in 116, Theorem 4--i], it was 
proved that if the law of X £ D^^^ has a density and if g{X) < j3X + 7 a.s. (with /3 > and 7 > 0), then 
for all z > 0, P[X > z] < exp I —jEz+T' ) ■ Using g^, (z) — {/Sz + 7) 1 , Point 1 in Corollary \23\ gives us an 
asymptotically better upper bound, with exponential rate e~^'^ instead o/e~^"^. Our rate is sharp, since 
our upper bound has the same exponential asymptotics as the corresponding Pearson tail, which is a Gamma 
tail. 

4.3 Asymptotic results 

Point 2 of Corollary [531 shows the precise behavior, up to a possibly different leading power term which 
is negligible compared to the exponential, of any random variable in D^'^ whose function g is equal to a 
Pearson function up to some uncertainty on the 7 value. More generally, one can ask about tail asymptotics 
for X when g is asymptotically linear, or even asymptotically quadratic. Asymptotic assumptions on g are 
not as strong as assuming bounds on g which are uniform in the support of X, and one cannot expect them 
to imply statements that are as strong as in the previous subsection. We now see that in order to prove 
tail asymptotics under asymptotic assumptions, it seems preferable to revert to the techniques developed in 
|22j . We first propose upper bound results for tail asymptotics, which follow from Point 1 of Corollary [22] 
and Point 1 of Corollary [23l Then for full asymptotics. Point 2 of each of these corollaries do not seem to 
be sufficient, while j22} Corollary 4.5] can be applied immediately. Recall that in what follows X £ D^'^ is 
centered, has a density, and support (a, 00), and g is defined by g (x) := E [G\X = x], or equivalently, by ([3]). 
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Proposition 25 

1. Suppose linisup^^+^ g (z) /z^ = a e (0, 1/2). Then limsup^_^+^ inP[x>z] < _ (i + i) . 

2. Suppose limsup^^^g^ g (z) /z — 13 > 0. Then linisup^_^^_j3^ — — ^ — £i < — 4. 

Proof. Fix £ e (0,1/2 — a). Then g (cc) < {a + e)x'^ if a; is large enough. Therefore, there exists a 
constant 7£ > such that g (x) < (a + e) x^ + 7^ for all x. Let Z^ be the Pearson random variable for which 
5, (z) = {a + s) z'^ + 7e- This falls under Case 5 in Appendix 15.21 so its support is (—00,00), which then 
contains the support of X. From Point 1 of Corollary [2H there is a constant c^ depending on e such that 
for z large enough, 

P[X > z] < CeZ-'^-^. 



We then have 



lnP[X > z] < Ince - ( IH ) Inz, 



a + e 



lnPfX>zl Inc. / 1 

< -, — - - 1 



In z In z \ a + e 

lnP[X>zl / 1 
hm sup i < - IH 

z^oo Inz \ a + 

Since e can be arbitrarily close to 0, Point 1 of the corollary is proved. The proof of Point 2 is entirely 
similar, following from Corollarv l23[ which refers to Case 2 of the Pearson distributions given in Appendix 
15.21 This corollary could also be established by using results from [22 . ■ 

Our final result gives full tail asymptotics. Note that it is not restricted to linear and quadratic behaviors. 
Theorem 26 

1. Suppose lim^^+oo .9 (z) /z^ = ae (0, 1). Then lim^^+^o '"^iL^^^^ ^ ~ {^ + a) ■ 

2. Suppose limz^+oo .9 (z) /z^ = /3 > for some p e [0, 1). Then lim^^+oo '"^i^p^^ = - ^ (2^-^) ■ 

Proof. Since for any e S (0, min(a, 1 — a)), there exists zq such that z > zq implies [a ~ e) z'^ < g (z) < 
(a + £)z^, the assumptions of Points 2 and 4 (a) in [521 Corollary 4.5] are satisfied, and Point 1 of the 
Theorem follows easily. Point 2 of the Theorem follows identically, by invoking Points 3 and 4 (b) in [22l 
Corollary 4.5]. All details are left to the reader. ■ 
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5 Appendix 

5.1 Proofs of lemmas 

Proof of lemma [TJ Proof of point 1. If < a; < 6, then clearly 5*(a;) > 0. If a < x < 0, we claim 
that (7*(x) > still. Suppose we have the opposite: g*(a:) < 0. Then / yp^:{y)dy = g^:{x)p^{x) < 0. Since 
Ja yP*iy) ^y < Oi then /^ yp*{y) dy < 0, contradicting E[Z] ~ 0. Thus, g*{x) > for all x, and g*{x) > if 
and only if a < x < &. 

Proof of point 2. Trivial. 

Proof of point 3. This is immediate since lim g* (x)p* (x) ~ 1™/ yp„{y)dy — E[Z] and similarly for 

liing^,{x)p^,{x) = — E[Z]. ■ 

Proof of Lemma [2j It is easy to verify that dTTI) and P^ are solutions to Stein's equation ([5]). To 
show that they are the same, let ip{z) :~ gf,{z)p^{z) = J wp, (w) dw for z G (a, b). Then 

ip'{z) zp^{z) z 



ip{z) g^{z)p^{z) g^{z)' 

Integrating over {y,x) C (a, b) leads to 



g*{z) ifix) g^{x)p^{x 

and so 



' d,^ log 44 = log ^44^ (25) 



eh tjii 1 g*{y)p*{y) p*{y) 



9*{y) g*{y) 9*{x)p*{x) g^{x)p^{x)' 

The derivative formula (|14p comes via an immediate calculation 

m = -N4^ r{Ky)-nKz)])pMdy' (m-)-e[m^)])p4^) 



[g^{x)p4x)Y Ja g*{x)p*{x) 

-xp^x) f%^^ ^ hixl~B\h{Z)l 

/i(x) - E[/i(Z)] 



(/i(y)-E[;i(Z)])p,(y)dy 



[5*(x)]2p,(x) y„ g*(x) 

■ 
Proof of Lemma \S[ From ((25|) in the previous proof, we have 

dz = hm / — -— dz = hm log * * = g,(0)p*(0) — hm log [.g*(x)p»(x)] = oo 



/o 5*(z) Xy^bjQ g*{z) x/'b g^,{x)p,,{x) x/'b 

and 

/■" z /"" -2 gJx)pJx) 

/ — r^ rfz = lim / — — - dz == lim log * , * , = lim log [gH.(x)p,(a;)] - .9,(0)p*(0) == -oo. 

7a 5*(-Z) ^^^Jx y*\z) 3;\a 5*(0)p,(0) 2;\a 

■ 

Proof of Lemma [71 We prove ([T7)) first. It is trivially true if x ^ [a, 6], so suppose x € (a, h). Let 

m(x) :^ $4x) „;* • g*(x)p,(x). 

Q(x) 

By a standard calculus proof, we will show that 7n'(x) < so that mix) > lim m{y). The result follows 

after observing that lim r7i(y) = 0. This is true since lim g*{y)p*{y) — and lim ^^(x) = 0. Now we show 

y^b y~+b y^b 

that to'(x) < 0. 
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m'{x) — —pf,{x) — g*(a;)p*(x) 



x-g'^x) 



x^ ~xg'^{x) +g*{x) 



Q{x) 



[9*{x)p.^{x)]' 



g*p* 



(x (x - g'J + g,) (1 - g':) - (x - gl) {2x - xg':) 



X- 9* 



-xp^ 



Q' 



-m 



-Q2 _ g^ [(a, _ g'j (a; _ j; g'^ _ 2x + x^:') + ff* ( 1 - g';)] + Qx{x~ gi) 

-x'^{x-g'^) - 2xg^ {x - g'^) ~ gl + xg^ {x - g'^) - gl {I - g'l) 
x^{x-g'^) +xg^{x-g'^) 

^-gl~9l{i-g':)^-9l{'^-9':)<^- 

To prove p^ (again, it suffices to prove this for x £ (a, 6)), let 

n(a;) := 1 - $»(a;) * g*{x)p^{x) = 1 - m{x) 

Q\x) 

so n'(a;) = —m'{x) > 0. n is then nondecreasing so n{x) > hm n{x) — 0. 

a;— ^a 

Now we prove P^ . If x > 0, 

^*{x) = p*{y)dy<- yp^{y)dy ^ - ■ g^{x)p^{x). 

J X X J ^ X 

On the other hand, if a; < 0, 

l-$*(a:;)=/ p*{y)dy<- yp^{y)dy^ 5,(x)p,(a;). 

J — CO X J_oo ■^ 

This proves (gHl)- ■ 

Proof of last bullet point on page 1141 We replicate here the method commonly used to find a 
recursive formula for the moments. See for example [7] and [TH]- Cross- multiplying the terms in (|23p . 
multiplying by x^ and integrating over the support gives us 

b fh 

[(2a + l)z"+i + Pz-"] p,(z) dz^ {az'-+^ + pz'-+^ + 72'') p',{z) dz 

J a 

-(2a + 1)E [Z-^+i] - ;3E [Z^ = (az''+2 + /3z''+i + 72'') p,{z)\\ 

[a{r + 2)z'-+i + (3{r + l)z'- + 7rz'^-i] p,{z) dz 

(2a + 1)E [^'■+1] + ^E [Z''] = a{r + 2)E [Z-^+i] + (3{r + 1)E [Z^ + 7rE [Z''-^] 

where we assumed that z''+^/9,(z) -> at the endpoints a and b of the support. For the case r = 1, this 
reduces to z'^ p^{z) — > at the endpoints a and 6, which we are assuming. Therefore, 

(2a + 1)E [Z^] + /3E [Z] = 3aE [Z^] + 2/3E [Z] + 7E [Z"] . 

Since E [Z] = and E [Z"] = 1, this gives E [Z^] = ^. ■ 

5.2 Examples of Pearson distributions 

We present cases of Pearson distributions depending on the degree and number of zeroes of g<t{x) as a 
quadratic polynomial in (a, &). The Pearson family is closed under affinc transformations of the random 
variable, so we can limit our focus on the five special cases below. The constant C in each case represents 
the normalization constant. See Diaconis and Zabell [5] for a discussion of these cases. 
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• CASE 1. If dcgg*(2:) = 0, p, can be (after an afiine transformation) written in the form p^,{z) = Cer^ /^ 
for —(X) < z < oo. This is the standard normal density, and C = 1/^/2tt. For this case, g*{z) = 1. 
Consequently, Q{z) = z^ + 1. If z > 0, the inequalities (IT71) and (IT5)) of Lemma[7]can be written 



e--'/2 < ^,{z) < —. 



{Z^ + 1)V^' - -' ' - zy^^ 

a standard inequality involving the tail of the standard normal distribution. 

CASE 2. If deg(7»(z) = 1, p* can be written in the form pr{z) — Cz'^^^e^^^'^ for < z < oo, with 
parameters r,s > 0. This is a Gamma density, and C = l/[sT(r)]. It has mean fi ~ rs > and 
variance rs^. If one wants to make Z centered, the density takes the form p*(z) = C(z + p)'"^^e^^^+^-'/'* 
for —/I < z < oo. For this case, g*{z) = s{z + /i)+. 

CASE 3. If degg^,{x) — 2 and g^, has two real roots, p* can be written in the form p*(a;) = Cx^~^{l — 
xy~^ for < a: < 1, with parameters r,s > 0. This is a Beta density, and C = l//3(r, s). It 
has mean /i = r/(r + s) > and variance rs/[{r + s)^(r + s + 1)]. Centering the density gives 
p*(x) = C{x + iJ.Y^^{l — X — iJ,y~^ for — /i < X < I ~ fi. For this case, g^{x) = 7x^(2; + m)(1 ~ ^ ~ m) 
when —p,<x< 1 — /i and elsewhere. 

CASE 4. If degg,(a;) — 2 and 5* has exactly one real root, p, can be written in the form p*{x) — 
Cx~'^e~^'^ for < a; < 00, with parameters r > 1 and s > 0. The normalization constant is 
C = rfr-i) ■ If '^ > 2, it has mean p = s/(r - 2) > 0. If r > 3, it has variance s^r(r - 3)/r(r - 1). 
Centering this density yields p*(x) = C{x + p)^'"e^''/(^+'^' for — p < a; < 00, and assume that r > 3. 
For this case, g*(x) = 7:12 (*^ ^ /^)^ when — p < x and elsewhere. 
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CASE 5. If degg*(a:) = 2 and g* has no real roots, p*{x) = C (l + x^) '^ gs(arctanx) j-^j, _qq < a; < 00, 
with parameters r > 1/2 and —00 < s < cx). The normalization constant is C = ^t'( -1/2 
lir > 1, it has mean p = s/ [2(r - 1)]. Ifr > 3/2, it has variance 4 (r - 1)^ + s"^ / [4(r - l)2(2r-3)] 
The centered form of the density is p, (a;) — C 1 + (a; + p) 



gS(arctan(a;+Ai)) f^j. _qq < a; < OO, and 



assuming that r > 3/2. For this case, g*{x) — „> ^.> 1 + (x + p) 



2(r-l) 
2(r-l) ' ^ ~ r-1 ""^-^ ' ^ 2(r-l) ' 



F^^' P - TTTT and 7 - 2(7:3^^ 



Using our original notation. 



5.3 Other Lemmas 

Lemma 27 Let Z be a centered Pearson random variable. Then there exist constants ku > ki > depending 
only on a,f3,j such that when z is large enough, we have the following inequalities. 

1. If a ^0 and [3 > 0, 

< $, (z) < 



2. If a > 0, when z is large enough, 

3. Assuming Z 's support extends to —00, if a> 0, when z < and \z\ is large enough, 

< 1 - $, (z) < 



|^|l + l/a - »W-|^|i + i/„. 
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Proof. For the proof of this lemma, which is presumably well-known, but is included for completeness, 
we will use C for the normalization constant of each density to be considered. 

In Point 1, let /i = 7//? > 0. Then Z has support (— /i, 00); see Case 2 in Appendix 15.21 In its support, 

Z has g^ {z) — (3z + "f — 13 {z + /i) and density 



p^^{^)^C{z + ^,)-^'^-'e^v{-"^) 



Note that 



lim z^'/^e'^l^g^ (z) p^ (z) = C/3 lim 



yt^/fi 



{z + p.) 



f^/p 



exp 



Z Z + fi 



C/3e- 



-P//9 



From Lemma [71 



z — P 1 

5* (z) p* (2) < $* (z) < -g* (z) p* (z) 



z^ +7 



so 



C/3e-''/^ < liminf z^+^/'^e^/'^*, (z) < limsupz^+'^/'^e^/'^*, (z) < C/^e"^/''. 

2^00 z-i-oo 

Therefore, we can choose some constants ku (/3, 7) > ci (/3, 7) > such that when z is large enough. 



■,l+f,/P(^z/P 



< $* (2) < 



■yl+f,/l3f^z/l3- 



To prove Point 2, we first show that lim2_j.oo z^'^g* (z) p, (z) is a finite number ii". We consider the cases 
4a7 — /3^ = and 4a7 — /3^ > separately. We need not consider 4q7 — /3^ < since it corresponds to Case 
3 in Appendix 15.21 for which the right endpoint of the support of Z is 6 < 00 and so necessarily a < 0. 

Supose that 4a7 — /3^ = and let ^ = ^ > 0. Then az^ + /3z + 7 = a (z + /x) has one real root and the 
support of Z is (— /i, 00); see Case 4 in Appendix 15.21 In its support, Z has g* (z) — a{z + fi) and density 

p^ (z) = C {z + fi)^ '"exp 



z + fi 



where s = p./ a = (3/ (2a^) . Therefore, 



lim z^'"5, {z) p* {z) = Ca lim 



,1/c 



(z + /x) 



1/a 



exp 



z + p 



= Ca. 



Now suppose that (5^ := (4a7 — /?^) / (4a^) > so az^ + /3z + 7 has two imaginary roots and the support 
of Z is (—00, cx)). Letting p — j3 / (2a) allows us to write g^, (z) = a (z + /i) + aS"^ and the density of Z as 



— - arctan — - — 
ad \ 



p., (z) =C {z + pf + 6'^ " exp 

a slight variation of the density in Case 5 in Appendix 15.21 Note than in our present case. 



lim z ' "5* (z) p, (z) = Ca lim 
From Lemma [71 



(z + Ai)' + (52 

(l~2a)z-/? 



— exp 



— - arctan — - — 
ad \ 



Caexp 



pn 



2aS 



(1 — a) z^ + 7 
From these bounds we conclude 

.l-2a 



{z) p* (z) < $, (z) < -g, (z) p, (z) . 

z 



isT- 



1-a 



< liminf zi+i/"$* (z) < limsupz^+i/"*, (z) < K. 
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Therefore, when z is large enough, for some constants fc„ (a, /3, 7) > fc; (a, /3, 7) > 0, 



To prove Point 3, we consider Case 5 again. 



hm \z\ '" g* (z) p* (z) = Ca hm 



,1/a 



2—^ — 00 



— exp 



The conclusion follows similarly after using Lemma [7] when z < 

(l-2a)|z|-/3 



—- arctan ; — 



1 



2 5*(^)P*(2) < l-**(^) < ^75*(^)P*(2)• 
(l-a)|z^+7 \z\ 



= Ca exp 



/iTT 

2^ 



Lemma 28 Let Z be a centered Pearson random variable. If a < 0, all moments of positive order exist. If 
a > 0, the moment of order m exists if and only if m < 1 + 1/a. 

Proof. The random variables in Case 1 {a — /3 — 0) oi Appendix 15.21 are Normal, while those in Case 3 
(a < 0) have finite intervals for support. It suffices to consider the cases where a = and /? > 0, and where 
a > 0. Let m > 0. We will use the fact that E [|Z|'"] < 00 if and only if X;^^! """^P [\Z\ >n]<oo. 

If a = and (3 > 0, and Z is supported over (a, 00), then by Lemma [27l E [1^1™] < 00 if and only if 



■c-^ n" 



n=l 



< 00, 



which is always the case. 

Now suppose a > 0. Since P [\Z\ > n] = $* (n) + 1 — ^^ (—n), thcon by Lemma [27] again, E [\Z\"^] < 00 
if and only if 



°° „m-l °° 1 



n—1 n—1 

This is the case if and only if 2 + 1/a — m > 1, i.e. m < 1 + 1/a. 



< 00. 
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